Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 3237593 |
| Missing cells | 4248777 |
| Missing cells (%) | 10.9% |
| Duplicate rows | 23514 |
| Duplicate rows (%) | 0.7% |
| Total size in memory | 296.4 MiB |
| Average record size in memory | 96.0 B |
Variable types
| Categorical | 3 |
|---|---|
| Numeric | 8 |
| Unsupported | 1 |
| Dataset has 23514 (0.7%) duplicate rows | Duplicates |
SEXO is highly imbalanced (50.3%) | Imbalance |
SUICIDIO is highly imbalanced (93.6%) | Imbalance |
ASSISTMED has 1475120 (45.6%) missing values | Missing |
ESC has 539840 (16.7%) missing values | Missing |
ESTCIV has 188999 (5.8%) missing values | Missing |
HORAOBITO has 196892 (6.1%) missing values | Missing |
NATURAL has 957086 (29.6%) missing values | Missing |
OCUP has 760072 (23.5%) missing values | Missing |
RACACOR has 130117 (4.0%) missing values | Missing |
HORAOBITO is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ESC has 64843 (2.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-21 16:47:13.784793 |
|---|---|
| Analysis finished | 2024-04-21 16:48:17.173318 |
| Duration | 1 minute and 3.39 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
ASSISTMED
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1475120 |
| Missing (%) | 45.6% |
| Memory size | 24.7 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 9.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5287419 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 2.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1387317 | |
| 2.0 | 193587 | 6.0% |
| 9.0 | 181569 | 5.6% |
| (Missing) | 1475120 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 1387317 | |
| 2.0 | 193587 | 11.0% |
| 9.0 | 181569 | 10.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1762473 | |
| 0 | 1762473 | |
| 1 | 1387317 | |
| 2 | 193587 | 3.7% |
| 9 | 181569 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5287419 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 1762473 | |
| 0 | 1762473 | |
| 1 | 1387317 | |
| 2 | 193587 | 3.7% |
| 9 | 181569 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5287419 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 1762473 | |
| 0 | 1762473 | |
| 1 | 1387317 | |
| 2 | 193587 | 3.7% |
| 9 | 181569 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5287419 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 1762473 | |
| 0 | 1762473 | |
| 1 | 1387317 | |
| 2 | 193587 | 3.7% |
| 9 | 181569 | 3.4% |
DTOBITO
Real number (ℝ)
| Distinct | 4383 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15758464 |
| Minimum | 1012006 |
|---|---|
| Maximum | 31122017 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 1012006 |
|---|---|
| 5-th percentile | 2072008 |
| Q1 | 8082007 |
| median | 16022016 |
| Q3 | 23102007 |
| 95-th percentile | 29122013 |
| Maximum | 31122017 |
| Range | 30110011 |
| Interquartile range (IQR) | 15020000 |
Descriptive statistics
| Standard deviation | 8789155.4 |
|---|---|
| Coefficient of variation (CV) | 0.55774189 |
| Kurtosis | -1.1896495 |
| Mean | 15758464 |
| Median Absolute Deviation (MAD) | 7920000 |
| Skewness | 0.01314967 |
| Sum | 5.1019492 × 1013 |
| Variance | 7.7249252 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19012015 | 1050 | < 0.1% |
| 14062016 | 1045 | < 0.1% |
| 7022014 | 1023 | < 0.1% |
| 9022014 | 1021 | < 0.1% |
| 17062016 | 1020 | < 0.1% |
| 10022014 | 1019 | < 0.1% |
| 13062016 | 1019 | < 0.1% |
| 16062016 | 1006 | < 0.1% |
| 24062016 | 997 | < 0.1% |
| 8022014 | 997 | < 0.1% |
| Other values (4373) | 3227396 |
| Value | Count | Frequency (%) |
| 1012006 | 662 | |
| 1012007 | 624 | |
| 1012008 | 825 | |
| 1012009 | 695 | |
| 1012010 | 752 | |
| 1012011 | 771 | |
| 1012012 | 687 | |
| 1012013 | 787 | |
| 1012014 | 859 | |
| 1012015 | 840 |
| Value | Count | Frequency (%) |
| 31122017 | 746 | |
| 31122016 | 768 | |
| 31122015 | 723 | |
| 31122014 | 748 | |
| 31122013 | 730 | |
| 31122012 | 693 | |
| 31122011 | 680 | |
| 31122010 | 724 | |
| 31122009 | 663 | |
| 31122008 | 622 |
ESC
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 539840 |
| Missing (%) | 16.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6110956 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 64843 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.5378352 |
|---|---|
| Coefficient of variation (CV) | 0.70278815 |
| Kurtosis | 0.36373429 |
| Mean | 3.6110956 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1959698 |
| Sum | 9741844 |
| Variance | 6.4406076 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 722256 | |
| 3 | 623304 | |
| 9 | 409313 | |
| 4 | 390218 | |
| 1 | 314091 | |
| 5 | 173728 | 5.4% |
| 0 | 64843 | 2.0% |
| (Missing) | 539840 |
| Value | Count | Frequency (%) |
| 0 | 64843 | 2.0% |
| 1 | 314091 | |
| 2 | 722256 | |
| 3 | 623304 | |
| 4 | 390218 | |
| 5 | 173728 | 5.4% |
| 9 | 409313 |
| Value | Count | Frequency (%) |
| 9 | 409313 | |
| 5 | 173728 | 5.4% |
| 4 | 390218 | |
| 3 | 623304 | |
| 2 | 722256 | |
| 1 | 314091 | |
| 0 | 64843 | 2.0% |
ESTCIV
Real number (ℝ)
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 188999 |
| Missing (%) | 5.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4289184 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.4293908 |
|---|---|
| Coefficient of variation (CV) | 0.5884886 |
| Kurtosis | 9.5578298 |
| Mean | 2.4289184 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.5790639 |
| Sum | 7404786 |
| Variance | 2.043158 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1170466 | |
| 3 | 853982 | |
| 1 | 679997 | |
| 4 | 229326 | 7.1% |
| 9 | 82623 | 2.6% |
| 5 | 32200 | 1.0% |
| (Missing) | 188999 | 5.8% |
| Value | Count | Frequency (%) |
| 1 | 679997 | |
| 2 | 1170466 | |
| 3 | 853982 | |
| 4 | 229326 | 7.1% |
| 5 | 32200 | 1.0% |
| 9 | 82623 | 2.6% |
| Value | Count | Frequency (%) |
| 9 | 82623 | 2.6% |
| 5 | 32200 | 1.0% |
| 4 | 229326 | 7.1% |
| 3 | 853982 | |
| 2 | 1170466 | |
| 1 | 679997 |
HORAOBITO
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 196892 |
|---|---|
| Missing (%) | 6.1% |
| Memory size | 24.7 MiB |
IDADE
Real number (ℝ)
| Distinct | 250 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 649 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 461.14141 |
| Minimum | 0 |
|---|---|
| Maximum | 999 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 421 |
| Q1 | 454 |
| median | 469 |
| Q3 | 481 |
| 95-th percentile | 492 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 54.780199 |
|---|---|
| Coefficient of variation (CV) | 0.11879262 |
| Kurtosis | 41.925204 |
| Mean | 461.14141 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.74688913 |
| Sum | 1.4926889 × 109 |
| Variance | 3000.8702 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 481 | 75080 | 2.3% |
| 480 | 75014 | 2.3% |
| 479 | 74653 | 2.3% |
| 478 | 74377 | 2.3% |
| 482 | 73590 | 2.3% |
| 477 | 73515 | 2.3% |
| 483 | 72136 | 2.2% |
| 476 | 71786 | 2.2% |
| 484 | 70603 | 2.2% |
| 475 | 69502 | 2.1% |
| Other values (240) | 2506688 |
| Value | Count | Frequency (%) |
| 0 | 3 | < 0.1% |
| 1 | 367 | |
| 2 | 114 | < 0.1% |
| 3 | 73 | < 0.1% |
| 4 | 40 | < 0.1% |
| 5 | 339 | |
| 6 | 39 | < 0.1% |
| 7 | 37 | < 0.1% |
| 8 | 38 | < 0.1% |
| 9 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 999 | 8815 | |
| 529 | 1 | < 0.1% |
| 527 | 1 | < 0.1% |
| 522 | 1 | < 0.1% |
| 520 | 1 | < 0.1% |
| 519 | 1 | < 0.1% |
| 518 | 2 | < 0.1% |
| 517 | 3 | < 0.1% |
| 516 | 3 | < 0.1% |
| 515 | 7 | < 0.1% |
LOCOCOR
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5485047 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.0428696 |
|---|---|
| Coefficient of variation (CV) | 0.6734688 |
| Kurtosis | 4.6988093 |
| Mean | 1.5485047 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.0291656 |
| Sum | 5013425 |
| Variance | 1.0875771 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2400433 | |
| 3 | 480252 | 14.8% |
| 2 | 178466 | 5.5% |
| 4 | 92684 | 2.9% |
| 5 | 81809 | 2.5% |
| 9 | 3947 | 0.1% |
| (Missing) | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 2400433 | |
| 2 | 178466 | 5.5% |
| 3 | 480252 | 14.8% |
| 4 | 92684 | 2.9% |
| 5 | 81809 | 2.5% |
| 9 | 3947 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 3947 | 0.1% |
| 5 | 81809 | 2.5% |
| 4 | 92684 | 2.9% |
| 3 | 480252 | 14.8% |
| 2 | 178466 | 5.5% |
| 1 | 2400433 |
NATURAL
Real number (ℝ)
MISSING 
| Distinct | 237 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 957086 |
| Missing (%) | 29.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 803.19575 |
| Minimum | 0 |
|---|---|
| Maximum | 999 |
| Zeros | 389 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 800 |
| Q1 | 829 |
| median | 835 |
| Q3 | 835 |
| 95-th percentile | 835 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 141.68118 |
|---|---|
| Coefficient of variation (CV) | 0.17639682 |
| Kurtosis | 18.802808 |
| Mean | 803.19575 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.5088083 |
| Sum | 1.8316935 × 109 |
| Variance | 20073.556 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 835 | 1394096 | |
| 831 | 204051 | 6.3% |
| 829 | 146988 | 4.5% |
| 826 | 91955 | 2.8% |
| 800 | 82237 | 2.5% |
| 841 | 51469 | 1.6% |
| 827 | 39458 | 1.2% |
| 823 | 36739 | 1.1% |
| 190 | 31927 | 1.0% |
| 825 | 27027 | 0.8% |
| Other values (227) | 174560 | 5.4% |
| (Missing) | 957086 |
| Value | Count | Frequency (%) |
| 0 | 389 | < 0.1% |
| 2 | 8 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 55 | < 0.1% |
| 6 | 7 | < 0.1% |
| 8 | 2129 | |
| 9 | 2 | < 0.1% |
| 10 | 335 | < 0.1% |
| 11 | 12 | < 0.1% |
| 12 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 999 | 10602 | 0.3% |
| 998 | 4 | < 0.1% |
| 853 | 631 | < 0.1% |
| 852 | 3338 | 0.1% |
| 851 | 1951 | 0.1% |
| 850 | 5321 | 0.2% |
| 843 | 6894 | 0.2% |
| 842 | 5965 | 0.2% |
| 841 | 51469 | 1.6% |
| 835 | 1394096 |
OCUP
Real number (ℝ)
MISSING 
| Distinct | 2220 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 760072 |
| Missing (%) | 23.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 826337.31 |
| Minimum | 0 |
|---|---|
| Maximum | 999994 |
| Zeros | 57 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 231205 |
| Q1 | 715210 |
| median | 999992 |
| Q3 | 999993 |
| 95-th percentile | 999993 |
| Maximum | 999994 |
| Range | 999994 |
| Interquartile range (IQR) | 284783 |
Descriptive statistics
| Standard deviation | 265871.13 |
|---|---|
| Coefficient of variation (CV) | 0.32174649 |
| Kurtosis | 0.49599816 |
| Mean | 826337.31 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.3292233 |
| Sum | 2.0472681 × 1012 |
| Variance | 7.068746 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 999993 | 828601 | |
| 999992 | 559932 | |
| 998999 | 113908 | 3.5% |
| 715210 | 68630 | 2.1% |
| 141410 | 53783 | 1.7% |
| 512105 | 39145 | 1.2% |
| 782305 | 33212 | 1.0% |
| 621005 | 27681 | 0.9% |
| 999991 | 26568 | 0.8% |
| 999994 | 26185 | 0.8% |
| Other values (2210) | 699876 | |
| (Missing) | 760072 |
| Value | Count | Frequency (%) |
| 0 | 57 | < 0.1% |
| 10105 | 10 | < 0.1% |
| 10110 | 7 | < 0.1% |
| 10115 | 2 | < 0.1% |
| 10205 | 98 | < 0.1% |
| 10210 | 131 | < 0.1% |
| 10215 | 47 | < 0.1% |
| 10305 | 21 | < 0.1% |
| 10310 | 3121 | |
| 10315 | 34 | < 0.1% |
| Value | Count | Frequency (%) |
| 999994 | 26185 | 0.8% |
| 999993 | 828601 | |
| 999992 | 559932 | |
| 999991 | 26568 | 0.8% |
| 998999 | 113908 | 3.5% |
| 992225 | 1914 | 0.1% |
| 992220 | 15 | < 0.1% |
| 992215 | 1 | < 0.1% |
| 992210 | 129 | < 0.1% |
| 992205 | 287 | < 0.1% |
RACACOR
Real number (ℝ)
MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 130117 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5957735 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.1350783 |
|---|---|
| Coefficient of variation (CV) | 0.71130288 |
| Kurtosis | 0.53122662 |
| Mean | 1.5957735 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5380467 |
| Sum | 4958828 |
| Variance | 1.2884028 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2354375 | |
| 4 | 525380 | 16.2% |
| 2 | 182874 | 5.6% |
| 3 | 43535 | 1.3% |
| 5 | 1307 | < 0.1% |
| 9 | 5 | < 0.1% |
| (Missing) | 130117 | 4.0% |
| Value | Count | Frequency (%) |
| 1 | 2354375 | |
| 2 | 182874 | 5.6% |
| 3 | 43535 | 1.3% |
| 4 | 525380 | 16.2% |
| 5 | 1307 | < 0.1% |
| 9 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 5 | < 0.1% |
| 5 | 1307 | < 0.1% |
| 4 | 525380 | 16.2% |
| 3 | 43535 | 1.3% |
| 2 | 182874 | 5.6% |
| 1 | 2354375 |
SEXO
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.7 MiB |
| 1 | |
|---|---|
| 2 | |
| 0 | 687 |
| 9 | 126 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3237593 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1797069 | |
| 2 | 1439711 | |
| 0 | 687 | < 0.1% |
| 9 | 126 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1797069 | |
| 2 | 1439711 | |
| 0 | 687 | < 0.1% |
| 9 | 126 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1797069 | |
| 2 | 1439711 | |
| 0 | 687 | < 0.1% |
| 9 | 126 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3237593 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1797069 | |
| 2 | 1439711 | |
| 0 | 687 | < 0.1% |
| 9 | 126 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3237593 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1797069 | |
| 2 | 1439711 | |
| 0 | 687 | < 0.1% |
| 9 | 126 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3237593 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1797069 | |
| 2 | 1439711 | |
| 0 | 687 | < 0.1% |
| 9 | 126 | < 0.1% |
SUICIDIO
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.7 MiB |
| 0 | |
|---|---|
| 1 | 24620 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3237593 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3212973 | |
| 1 | 24620 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3212973 | |
| 1 | 24620 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3212973 | |
| 1 | 24620 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3237593 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3212973 | |
| 1 | 24620 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3237593 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3212973 | |
| 1 | 24620 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3237593 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3212973 | |
| 1 | 24620 | 0.8% |
| ASSISTMED | DTOBITO | ESC | ESTCIV | IDADE | LOCOCOR | NATURAL | OCUP | RACACOR | SEXO | SUICIDIO | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ASSISTMED | 1.000 | -0.001 | 0.059 | -0.046 | -0.111 | 0.462 | 0.010 | -0.012 | 0.056 | 0.081 | 0.160 |
| DTOBITO | -0.001 | 1.000 | -0.000 | 0.002 | 0.003 | -0.002 | 0.001 | 0.003 | -0.000 | 0.002 | 0.000 |
| ESC | 0.059 | -0.000 | 1.000 | -0.009 | -0.200 | 0.038 | 0.143 | -0.188 | -0.068 | 0.093 | 0.057 |
| ESTCIV | -0.046 | 0.002 | -0.009 | 1.000 | 0.373 | -0.038 | -0.003 | 0.141 | -0.107 | 0.209 | 0.072 |
| IDADE | -0.111 | 0.003 | -0.200 | 0.373 | 1.000 | -0.038 | -0.116 | 0.318 | -0.169 | 0.093 | 0.016 |
| LOCOCOR | 0.462 | -0.002 | 0.038 | -0.038 | -0.038 | 1.000 | 0.009 | -0.022 | 0.028 | 0.072 | 0.125 |
| NATURAL | 0.010 | 0.001 | 0.143 | -0.003 | -0.116 | 0.009 | 1.000 | -0.016 | -0.141 | 0.030 | 0.010 |
| OCUP | -0.012 | 0.003 | -0.188 | 0.141 | 0.318 | -0.022 | -0.016 | 1.000 | -0.044 | 0.229 | 0.048 |
| RACACOR | 0.056 | -0.000 | -0.068 | -0.107 | -0.169 | 0.028 | -0.141 | -0.044 | 1.000 | 0.032 | 0.013 |
| SEXO | 0.081 | 0.002 | 0.093 | 0.209 | 0.093 | 0.072 | 0.030 | 0.229 | 0.032 | 1.000 | 0.042 |
| SUICIDIO | 0.160 | 0.000 | 0.057 | 0.072 | 0.016 | 0.125 | 0.010 | 0.048 | 0.013 | 0.042 | 1.000 |
| ASSISTMED | DTOBITO | ESC | ESTCIV | HORAOBITO | IDADE | LOCOCOR | NATURAL | OCUP | RACACOR | SEXO | SUICIDIO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | NaN | 9022006 | 9.0 | 4.0 | 130.0 | 463.0 | 3.0 | NaN | 999993.0 | 1.0 | 1 | 0 |
| 1 | NaN | 26012006 | 2.0 | 2.0 | 1130.0 | 481.0 | 3.0 | NaN | 214305.0 | NaN | 1 | 0 |
| 2 | 1.0 | 19032006 | 2.0 | 3.0 | 1520.0 | 493.0 | 3.0 | NaN | 514105.0 | 1.0 | 2 | 0 |
| 3 | 1.0 | 21112006 | 2.0 | 1.0 | 1000.0 | 489.0 | 3.0 | 77.0 | 214305.0 | 1.0 | 1 | 0 |
| 4 | NaN | 16042006 | 9.0 | 3.0 | 2130.0 | 480.0 | 3.0 | NaN | NaN | 1.0 | 2 | 0 |
| 5 | NaN | 7052006 | 3.0 | 3.0 | 2015.0 | 468.0 | 3.0 | NaN | NaN | 1.0 | 1 | 0 |
| 6 | 2.0 | 28012006 | 9.0 | 3.0 | 2230.0 | 487.0 | 3.0 | NaN | 214305.0 | 4.0 | 2 | 0 |
| 7 | NaN | 13122006 | NaN | NaN | 2020.0 | 999.0 | 9.0 | 835.0 | NaN | NaN | 1 | 0 |
| 8 | NaN | 28052006 | 2.0 | 2.0 | 1706.0 | 450.0 | 5.0 | 835.0 | 512120.0 | 1.0 | 1 | 0 |
| 9 | 1.0 | 6042006 | NaN | 3.0 | 820.0 | 487.0 | 3.0 | 835.0 | 999992.0 | 3.0 | 2 | 0 |
| ASSISTMED | DTOBITO | ESC | ESTCIV | HORAOBITO | IDADE | LOCOCOR | NATURAL | OCUP | RACACOR | SEXO | SUICIDIO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3237583 | 1.0 | 21102017 | 9.0 | 9.0 | NaN | 482.0 | 1.0 | 999.0 | 999993.0 | NaN | 1 | 0 |
| 3237584 | NaN | 4112017 | 3.0 | 2.0 | 1515.0 | 491.0 | 2.0 | NaN | 999993.0 | 1.0 | 2 | 0 |
| 3237585 | NaN | 27092017 | 2.0 | 4.0 | 2035.0 | 474.0 | 3.0 | 829.0 | NaN | 1.0 | 1 | 0 |
| 3237586 | NaN | 7102017 | 9.0 | NaN | 419.0 | 486.0 | 3.0 | 835.0 | 999993.0 | NaN | 2 | 0 |
| 3237587 | NaN | 28062017 | 1.0 | 1.0 | NaN | 441.0 | 5.0 | 835.0 | 999993.0 | NaN | 2 | 0 |
| 3237588 | NaN | 13042017 | NaN | NaN | 1840.0 | 465.0 | 3.0 | 835.0 | NaN | 1.0 | 1 | 0 |
| 3237589 | NaN | 19042017 | 2.0 | 1.0 | 937.0 | 455.0 | 3.0 | 829.0 | 999993.0 | 4.0 | 1 | 0 |
| 3237590 | 1.0 | 10012017 | 2.0 | 3.0 | 615.0 | 487.0 | 2.0 | 835.0 | 999993.0 | 1.0 | 1 | 0 |
| 3237591 | 1.0 | 10082017 | 2.0 | 2.0 | 500.0 | 486.0 | 3.0 | 835.0 | 999993.0 | 1.0 | 1 | 0 |
| 3237592 | 2.0 | 6052017 | NaN | NaN | 1030.0 | 215.0 | 5.0 | 835.0 | NaN | 1.0 | 2 | 0 |
Most frequently occurring
| ASSISTMED | DTOBITO | ESC | ESTCIV | IDADE | LOCOCOR | NATURAL | OCUP | RACACOR | SEXO | SUICIDIO | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1323 | 1.0 | 3122012 | 2.0 | 3.0 | 485.0 | 1.0 | NaN | 999992.0 | 1.0 | 2 | 0 | 5 |
| 4258 | 1.0 | 10062016 | NaN | NaN | 301.0 | 1.0 | 835.0 | NaN | 1.0 | 1 | 0 | 5 |
| 156 | 1.0 | 1052014 | 2.0 | 3.0 | 489.0 | 1.0 | 835.0 | 999993.0 | 1.0 | 2 | 0 | 4 |
| 354 | 1.0 | 1092012 | 2.0 | 3.0 | 484.0 | 1.0 | NaN | 999992.0 | 1.0 | 2 | 0 | 4 |
| 660 | 1.0 | 2052015 | 2.0 | 2.0 | 472.0 | 1.0 | 835.0 | 999992.0 | 1.0 | 2 | 0 | 4 |
| 722 | 1.0 | 2072012 | 2.0 | 3.0 | 485.0 | 1.0 | NaN | 999992.0 | 1.0 | 2 | 0 | 4 |
| 892 | 1.0 | 2122007 | 2.0 | 2.0 | 472.0 | 1.0 | 835.0 | 999992.0 | 1.0 | 2 | 0 | 4 |
| 902 | 1.0 | 2122012 | 3.0 | 2.0 | 470.0 | 1.0 | NaN | 999993.0 | 1.0 | 1 | 0 | 4 |
| 1286 | 1.0 | 3112012 | 2.0 | 3.0 | 486.0 | 1.0 | NaN | 999992.0 | 1.0 | 2 | 0 | 4 |
| 1527 | 1.0 | 4052017 | NaN | NaN | 301.0 | 1.0 | 835.0 | NaN | 1.0 | 1 | 0 | 4 |